Learning to Race: Experiments with a Simulated Race Car

نویسندگان

  • Larry D. Pyeatt
  • Adele E. Howe
چکیده

Experiments with a Simulated Race Car Larry D. Pyeatt Adele E. Howe Colorado State University Fort Collins, CO 80523 email: fpyeatt,[email protected] URL: http://www.cs.colostate.edu/~fpyeatt,howeg Abstract We have implemented a reinforcement learning architecture as the reactive component of a two layer control system for a simulated race car. We have found that separating the layers has expedited gradually improving competition and multagent interaction. We ran experiments to test the tuning, decomposition and coordination of the low level behaviors. We then extended our control system to allow passing of other cars and tested its ability to avoid collisions. The best design used reinforcement learning with separate networks for each behavior, coarse coded input and a simple rule based coordination mechanism. Introduction Autonomous agents require a mix of behaviors, i.e., responses to di erent stimuli. This is especially true in situations where there are other agents present or where the environment is otherwise nondeterministic. For an agent to be e ective in its environment, it must have a large repertoire of behaviors and must be able to coordinate the use of those behaviors e ectively. Reactive systems have been favored for applications requiring quick responses, such as robotic or process control applications(Arkin 1994). Unfortunately, it is di cult to tune, decompose and coordinate reactive behaviors while ensuring consistency. In this paper, we present a case study of designing a mostly reactive autonomous agent for learning to perform a task in a multi-agent environment. The agent controls a race car in the Robot Automobile Racing Simulator (RARS) system (Timin 1995). To be successful, the agent must quickly respond to its environment and must have mastered several skills: steering, accelerating, and passing. To address these requirements, our basic agent architecture (Figure 1) adopts the common two-layer control structure: low level behaviors (implemented Copyright c 1998, American Association for Articial Intelligence (www.aaai.org). All rights reserved. Partial State Vector Sensors Effectors Actions Environment

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of the maintenance of root canal anatomy in severely curved resin blocks using RaCe and Flexmaster rotary systems

Comparison of the maintenance of root canal anatomy in severely curved resin blocks using RaCe and Flexmaster rotary systems Dr. S. Khedmat* - Dr. M. Vahdatifar** *- Associate Professor of Endodontics Dept. - Faculty of Dentistry – Tehran University of Medical Sciences. **- Endodontist. Background and Aim: Various rotary systems are available on the market, therefore selection of a proper instr...

متن کامل

Learning Coordinated Behaviors for Control of a Simulated Race Car

category: Control, navigation and planning This work is not in submission or press to any other conference. Abstract The demands of rapid response and the complexity of many environments make it diicult to decompose, tune and coordinate re-active behaviors while ensuring consistency. We hypothesize that complex behaviors should be decomposed into separate behaviors resident in separate networks...

متن کامل

Reinforcement Learning with Decision Trees

We present a decision tree based approach to function approximation in reinforcement learning. We compare our approach with table lookup and a neural network function approximator on three problems: the well known mountain car and pole balance problems as well as a simulated automobile race car. We find that the decision tree can provide better learning performance than the neural network funct...

متن کامل

مقایسه تأثیر سیستمهای چرخشی‌ Profile و Race بر آناتومی اولیه کانالهای مشابه‌‌سازی شده دارای انحنای شدید

Background and Aim: Recent advances in technology have permitted manufacturers to produce endodontic files from NiTi alloy and rotary systems. According to manufacturers, these files maintain original canal shape better than hand files and reduce treatment time. The aim of this study was to compare a new rotary system (Race) with an older one (Profile) in maintaining the original shape of curve...

متن کامل

Decision Tree Function Approximation in Reinforcement Learning

We present a decision tree based approach to function approximation in reinforcement learning. We compare our approach with table lookup and a neural network function approximator on three problems: the well known mountain car and pole balance problems as well as a simulated automobile race car. We find that the decision tree can provide better learning performance than the neural network funct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998